[SPARK-25877][k8s] Move all feature logic to feature classes. #23220

vanzin · 2018-12-05T00:14:54Z

This change makes the driver and executor builders a lot simpler
by encapsulating almost all feature logic into the respective
feature classes. The only logic that remains is the creation of
the initial pod, which needs to happen before anything else so
is better to be left in the builder class.

Most feature classes already behave fine when the config has nothing
they should handle, but a few minor tweaks had to be added. Unit
tests were also updated or added to account for these.

The builder suites were simplified a lot and just test the remaining
pod-related code in the builders themselves.

This change makes the driver and executor builders a lot simpler by encapsulating almost all feature logic into the respective feature classes. The only logic that remains is the creation of the initial pod, which needs to happen before anything else so is better to be left in the builder class. Most feature classes already behave fine when the config has nothing they should handle, but a few minor tweaks had to be added. Unit tests were also updated or added to account for these. The builder suites were simplified a lot and just test the remaining pod-related code in the builders themselves.

SparkQA · 2018-12-05T00:28:43Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/testing-k8s-prb-make-spark-distribution-unified/5739/

SparkQA · 2018-12-05T00:29:42Z

Test build #99687 has finished for PR 23220 at commit a13bafd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-12-05T00:39:26Z

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/testing-k8s-prb-make-spark-distribution-unified/5739/

SparkQA · 2018-12-07T01:13:42Z

Test build #99800 has finished for PR 23220 at commit e217e56.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
class ArrowCollectSerializer(Serializer):

SparkQA · 2018-12-07T01:17:34Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/testing-k8s-prb-make-spark-distribution-unified/5837/

SparkQA · 2018-12-07T01:28:20Z

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/testing-k8s-prb-make-spark-distribution-unified/5837/

vanzin · 2018-12-11T00:03:52Z

So, anybody interested in reviewing this?

mccheah

The main code looks a lot better, thanks! I have a concern about the tests.

mccheah · 2018-12-11T02:29:38Z

...ce-managers/kubernetes/core/src/test/scala/org/apache/spark/deploy/k8s/PodBuilderSuite.scala

+    val spec = pod.pod.getSpec
+    assert(!spec.getContainers.asScala.exists(_.getName == "executor-container"))
+    assert(spec.getDnsPolicy === "dns-policy")
+    assert(spec.getHostAliases.asScala.exists(_.getHostnames.asScala.exists(_ == "hostname")))


Nit: The second call to exists can be contains instead, so that we don't pass a function object that ignores the argument. Alternatively, both exists calls can be removed:

spec.getHostAliases.asScala.flatMap(_.getHostnames).contains("hostname")

I'm not modifying this code, just moving it from its previous location.

mccheah · 2018-12-11T02:30:30Z

...ce-managers/kubernetes/core/src/test/scala/org/apache/spark/deploy/k8s/PodBuilderSuite.scala

+    assert(metadata.getNamespace === "namespace")
+    assert(metadata.getOwnerReferences.asScala.exists(_.getName == "owner-reference"))
+    val spec = pod.pod.getSpec
+    assert(!spec.getContainers.asScala.exists(_.getName == "executor-container"))


Nit: Use .asScala.map(_.getName).contains("executor-container"). Similar changes come up throughout this test.

I'm not modifying this code, just moving it from its previous location.

mccheah · 2018-12-11T02:33:43Z

...ce-managers/kubernetes/core/src/test/scala/org/apache/spark/deploy/k8s/PodBuilderSuite.scala

+    kubernetesClient
+  }
+
+  private def verifyPod(pod: SparkPod): Unit = {


The number of things this test checks is remarkable, and it is very much possible to accidentally omit checking the application of a specific feature when a new one is added for either the driver or executor. This is why we had the overridable feature steps in the original incarnation of these tests. Not mocking the substeps leads us to need to check that some specific aspect of each step has been applied. Can we go back to mocking the different steps so that this test can be more easily modified when we add more features? Or else can we abstract away the idea that these steps are applied without this test itself needing to know what the step itself actually does?

Another factor of my concern about is that for each individual assertion, it is unclear which step the assertion is tied to. This reads a lot more like an ETE test than a unit test.

I actually did not write this test. I copy & pasted it with zero modifications from the previous class, and I'd prefer to keep it that way.

That's also an argument for not restoring the mocks, which would go against what this change is doing. This test should account for modifications made by other steps, since if they modify something unexpected, that can change the semantics of the feature (pod template support).

That's also an argument for not restoring the mocks, which would go against what this change is doing. This test should account for modifications made by other steps, since if they modify something unexpected, that can change the semantics of the feature (pod template support).

Wouldn't most of those unexpected changes come from the unit tests of the individual steps? Granted this test can catch when a change in one step impacts behavior in another step, which is important. Given that this isn't changing prior code I'm fine with leaving this as-is and addressing again later if it becomes a problem.

mccheah · 2018-12-11T23:42:01Z

+1 from me, would like @liyinan926 to take a second look

liyinan926

LGTM

mccheah · 2018-12-12T19:56:01Z

Merging to master

This change makes the driver and executor builders a lot simpler by encapsulating almost all feature logic into the respective feature classes. The only logic that remains is the creation of the initial pod, which needs to happen before anything else so is better to be left in the builder class. Most feature classes already behave fine when the config has nothing they should handle, but a few minor tweaks had to be added. Unit tests were also updated or added to account for these. The builder suites were simplified a lot and just test the remaining pod-related code in the builders themselves. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes apache#23220 from vanzin/SPARK-25877.

Merge branch 'master' into SPARK-25877

e217e56

mccheah suggested changes Dec 11, 2018

View reviewed changes

liyinan926 approved these changes Dec 12, 2018

View reviewed changes

mccheah approved these changes Dec 12, 2018

View reviewed changes

asfgit closed this in a63e7b2 Dec 12, 2018

vanzin deleted the SPARK-25877 branch December 12, 2018 20:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-25877][k8s] Move all feature logic to feature classes. #23220

[SPARK-25877][k8s] Move all feature logic to feature classes. #23220

vanzin commented Dec 5, 2018

SparkQA commented Dec 5, 2018

SparkQA commented Dec 5, 2018

SparkQA commented Dec 5, 2018

SparkQA commented Dec 7, 2018

SparkQA commented Dec 7, 2018

SparkQA commented Dec 7, 2018

vanzin commented Dec 11, 2018

mccheah left a comment

mccheah Dec 11, 2018 •

edited

Loading

vanzin Dec 11, 2018

mccheah Dec 11, 2018

vanzin Dec 11, 2018

mccheah Dec 11, 2018

mccheah Dec 11, 2018

vanzin Dec 11, 2018

mccheah Dec 11, 2018

mccheah commented Dec 11, 2018

liyinan926 left a comment

mccheah commented Dec 12, 2018

[SPARK-25877][k8s] Move all feature logic to feature classes. #23220

[SPARK-25877][k8s] Move all feature logic to feature classes. #23220

Conversation

vanzin commented Dec 5, 2018

SparkQA commented Dec 5, 2018

SparkQA commented Dec 5, 2018

SparkQA commented Dec 5, 2018

SparkQA commented Dec 7, 2018

SparkQA commented Dec 7, 2018

SparkQA commented Dec 7, 2018

vanzin commented Dec 11, 2018

mccheah left a comment

Choose a reason for hiding this comment

mccheah Dec 11, 2018 • edited Loading

Choose a reason for hiding this comment

vanzin Dec 11, 2018

Choose a reason for hiding this comment

mccheah Dec 11, 2018

Choose a reason for hiding this comment

vanzin Dec 11, 2018

Choose a reason for hiding this comment

mccheah Dec 11, 2018

Choose a reason for hiding this comment

mccheah Dec 11, 2018

Choose a reason for hiding this comment

vanzin Dec 11, 2018

Choose a reason for hiding this comment

mccheah Dec 11, 2018

Choose a reason for hiding this comment

mccheah commented Dec 11, 2018

liyinan926 left a comment

Choose a reason for hiding this comment

mccheah commented Dec 12, 2018

mccheah Dec 11, 2018 •

edited

Loading